PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Csa03g031770.2
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; malvids; Brassicales; Brassicaceae; Camelineae; Camelina
Family HB-other
Protein Properties Length: 1722aa    MW: 193678 Da    PI: 4.8884
Description HB-other family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Csa03g031770.2genomeCSGPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox60.52.7e-194196257
                    T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHHC CS
        Homeobox  2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakekk 57
                     kR+  t+ qle+Le+++ +++yps+++r++L++kl+Lt+rq ++WF+ rR k+kk
  Csa03g031770.2 41 PKRQMKTPFQLETLEKVYSEEKYPSEATRADLSDKLNLTDRQLQMWFCHRRLKDKK 96
                    69****************************************************98 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.607.8E-192396IPR009057Homeodomain-like
SuperFamilySSF466892.48E-163297IPR009057Homeodomain-like
PROSITE profilePS5007116.5363797IPR001356Homeobox domain
SMARTSM003895.0E-1839101IPR001356Homeobox domain
PfamPF000467.4E-174196IPR001356Homeobox domain
CDDcd000861.23E-144296No hitNo description
PROSITE profilePS5082718.031558617IPR018501DDT domain
SMARTSM005714.6E-24558617IPR018501DDT domain
PfamPF027919.3E-18559614IPR018501DDT domain
PfamPF050669.1E-16740807IPR007759HB1/Asxl, restriction endonuclease HTH domain
PfamPF156127.8E-7940981IPR028942WHIM1 domain
PfamPF156134.5E-1311151187IPR028941WHIM2 domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1722 aa     Download sequence    Send to blast
MEMGSDEEED QIRSVADVVA GSNNNKKKNK IDNSSSSSAK PKRQMKTPFQ LETLEKVYSE  60
EKYPSEATRA DLSDKLNLTD RQLQMWFCHR RLKDKKDDQS QSKTPVKPAV PAAVRPPPPA  120
FASSVNDLPP ARSVPEQDSG SGSDSGSGCS PYSDSRRNFA SGSSSSRAEL DEYETMVKPS  180
YEPRLSAMVR RAIVCIEAQL GEPLRDDGPI LGMEFDPLPP GAFGSPIAMQ KHLLHPYESK  240
MYEPHDVRPR RSQAAARSFH EQQSLDDPSS FTPEMYGRYS ENHAHGMDYE IARPRSSSFM  300
HENGSLPRSY GTPGYVSRNC STSQQDMPSP IVASAHRGDR FLMEKDSSVL GTEDPYMLSD  360
GVHKSNDVHR KGKIHDVRLG RGSETRENRG PKDLEKLEIQ KKKNEERMRK EMERNERERR  420
KEEERLMRER IKEEERLQRE QRREMERREK FLQRENERAE KKKQKEEIRR EKDAIRRKIA  480
IEKATARRIA KESMDLIEDE QLELMELAAI SKGLPSVLQL DHDTLQNLEL YRDSLSTFPP  540
KGLQLKMPFA ISPWKDSDES VGNLLMVWRF LTSFSDVLDL WPFTLDEFIQ AFHDYDSRLL  600
GEIHVTLLRS IIRDIEDVAR TPFSGIGNNQ YTTANPEGGH PQIVEGAYAW GFDIRSWKKN  660
LNPLTWPEIL RQLALSTGLG PRLKKKSSRF THTGDKDEAK GCEDIISTIR SGSAAESAFA  720
LMREKGLLAP RKSRHRLTPG TVKFAAFHVL SLEGSKGLTV LELADKIQKS GLRDLTTSKT  780
PEASISVALT RDVKLFERIA PSTYCVRAPY VKDPADGEAI LADARKKIRA FESGLTGPED  840
VNDLERDEDF EIDIDEDPEV DDLATLASAS KSADLDEANV LSGKGGDTMF CDVKAGVKSE  900
IEKEFSSPPP SSIKSIAPQH NERLKDTAVG CVDAMVDESN EGQSWIQGLT EGDYCHLSVE  960
ERLNALVALV GIANEGNSIR AGLEDRMEAA NSLKKQMWAE AQLDNSCMRD VLKLDFQNLA  1020
SSKTESTMGL PIIQSSNRER DNFGGDPSEL LDEKKPLEVV SNDLQKSTAE RGLINQEAII  1080
SQENCSFQQG YVSKRSRSQL KSYIGHKAEE VYPYRSLPVG QDRRHNRYWL FAASASKSDP  1140
SSGLLFVELH DGKWLLIDSE EAFDTLVASL DMRGIRESHL RIMLQKIEGS FKENARKNMK  1200
LARNPFLKEK SVMNHSPTDS VSPSSAVSGS NSDSMETSNS IRVELGRNDT EKKSLSKRFH  1260
DFQRWMWTET YSSLPSCAKK YGKKRSELLA TCALCVASYL SEYTHCTSCH QRLDMVDDSE  1320
ILDSGLTVSP LPFGVRLLKP LLVFLEASIP DEALESFWTE DKRKIWGFRL NASSSPEEAL  1380
QVLTTLETAI KKEYLSSNFM SAKELLGVGD ADADDPGSVD VLPWIPKTVS AVALRLSELD  1440
ASIIYVKPEK PDLIPEDETE QISLFPGDSL FKGKGPREQE DQDEVVPNLG NRRSNKRARV  1500
SLGSGSNKKV KRKKAQGGPN RFVVSQRNVA VDNNLMSMEL NHQIPGRGKR TVRKRPERIN  1560
EENDHLVNRM ADIVRPKTQE VEEDEEEEEQ TFRDIDEDWA AGETPREMDE DWANETPNRM  1620
TPMQVDDESD NSVGVESEDD DVDGQFVDYS QRNKWGLDWN SNANEAAMED EEEEEVVGVE  1680
RVEGEDDAEI SESSEDDDDV PANNAANNYD RESEGYSSSD S*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
18996RRLKDKKD
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAC0101550.0AC010155.3 Genomic sequence for Arabidopsis thaliana BAC F3M18 from chromosome I, complete sequence.
GenBankCP0026840.0CP002684.1 Arabidopsis thaliana chromosome 1 sequence.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_010499372.10.0PREDICTED: uncharacterized protein LOC104776906
SwissprotF4HY560.0RLT1_ARATH; Homeobox-DDT domain protein RLT1
TrEMBLD7KCW80.0D7KCW8_ARALL; HB-1
STRINGfgenesh2_kg.1__3015__AT1G28420.10.0(Arabidopsis lyrata)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT1G28420.10.0homeobox-1